European Radiology
○ Springer Science and Business Media LLC
Preprints posted in the last 30 days, ranked by how well they match European Radiology's content profile, based on 14 papers previously published here. The average preprint has a 0.04% match score for this journal, so anything above that is already an above-average fit.
Chaves, E. T.; Teunis, J. T.; Digmayer Romero, V. H.; van Nistelrooij, N.; Vinayahalingam, S.; Sezen-Hulsmans, D.; Mendes, F. M.; Huysmans, M.-C.; Cenci, M. S.; Lima, G. d. S.
Show abstract
Background: Radiographic detection of caries lesions adjacent to restorations is challenging due to limitations of two-dimensional imaging and difficulties distinguishing true lesions from restorative or anatomical radiolucencies. Artificial intelligence (AI)-based clinical decision support systems (CDSSs) have been introduced to assist radiographic interpretation; however, different AI tools may yield variable diagnostic outputs, and their comparative performance remains unclear. Objective: To compare the diagnostic performance of commercial and experimental AI algorithms for detecting secondary caries lesions on bitewings. Methods: This cross-sectional diagnostic accuracy study included 200 anonymized bitewings comprising 885 restored tooth surfaces. A consensus group reference standard identified all surfaces with a caries lesion and classified each lesion by type (primary/secondary) and depth (enamel-only/dentin-involved). Five commercial (Second Opinion, CranioCatch, Diagnocat, DIO Inteligencia, and Align X-ray Insights) and three experimental (Mask R-CNN-based and Mask DINO-based) systems were tested. Diagnostic performance was expressed through sensitivity, specificity, and overall accuracy (95% CI). Comparisons used generalized estimating equations, adjusted for clustered data. Results: Specificity was high across all systems (0.957-0.986), confirming accurate recognition of non-carious surfaces, whereas sensitivity was moderate (0.327-0.487), reflecting frequent missed detections of enamel and dentin lesions. Accuracy ranged from 0.882 to 0.917, with no significant differences among models (p >= 0.05). Confounding factors, such as radiographic overlapping, marginal restoration defects, and cervical artifacts, were the main sources of misclassification. Conclusions: AI algorithms, regardless of architecture or commercial status, showed similar diagnostic capabilities and a conservative detection profile, favoring specificity over sensitivity. Improvements in dataset diversity, labeling precision, and explainability may further enhance reliability for secondary caries detection. Clinical Significance: AI-based CDSSs assist clinicians by providing consistent detection. Their high specificity is particularly valuable in minimizing unnecessary invasive treatments (overtreatment), though they should be used as adjuncts rather than a replacement for expert judgment.
de Boer, S.; Häntze, H.; Ziegelmayer, S.; van Ginneken, B.; Prokop, M.; Bressem, K. K.; Hering, A.
Show abstract
Background: Medical imaging, especially computed tomography and magnetic resonance imaging, is essential in clinical care of patients with renal cell carcinoma (RCC). Artificial intelligence (AI) research into computer-aided diagnosis, staging and treatment planning needs curated and annotated datasets. Across literature, The Cancer Genome Atlas (TCGA) datasets are widely used for model training and validation. However, re-annotation is often necessary due to limited access to public annotations, raising entry barriers and hindering comparison with prior work. Methods: We screened 1915 CT scans from three TCGA-RCC databases and employed a segmentation model to annotate kidney lesion. After a meta-data-based exclusion step, we hosted a reader study with all papillary (n=56), chromophobe (n=27) and 200 randomly selected clear cell RCC cases. Two students quality checked and corrected the data as well as annotated tumors and cysts. Uncertain cases were checked by a board-certified radiologist. Results: After data exclusion and quality control a total of 142 annotated CT scans from 101 patients (26 female, 75 male, mean age 56 years) remained. This includes 95 CTs with clear cell RCC, 29 with papillary RCC and 18 with chromophobe RCC. Images and voxel-level annotations of kidneys and lesions are open sourced at https://zenodo.org/records/19630298. Conclusion: By making the annotations open-source, we encourage accessible and reproducible AI research for renal cell carcinoma. We invite other researchers who have previously annotated any of these cohorts to share their annotations.
Alqaderi, H.; Kapadia, U.; Brahmbhatt, Y.; Papathanasiou, A.; Rodgers, D.; Arsenault, P.; Cardarelli, J.; Zavras, A.; Li, H.
Show abstract
BackgroundDental caries and periodontal disease represent the most prevalent global oral health conditions, collectively affecting several billion people. The diagnostic interpretation of dental radiographs, a cornerstone of modern dentistry, is associated with considerable inter-observer variability. In routine clinical practice, clinicians are required to evaluate a high volume of radiographic images daily, a cognitively demanding task in which diagnostic fatigue, time constraints, and the inherent complexity of overlapping anatomical structures can lead to the inadvertent oversight of early-stage pathologies. Artificial intelligence (AI) offers a transformative opportunity to augment clinical decision-making by providing rapid, objective, and consistent radiographic analysis, thereby serving as a tireless adjunct capable of flagging findings that may be missed during routine human inspection. MethodsThis study developed and validated a deep learning system for the automated detection of dental caries and alveolar bone loss using a dataset of 1,063 periapical and bitewing radiographs. Two separate YOLOv8s object detection models were trained and evaluated using a rigorous 5-fold cross-validation methodology. To align with the clinical use-case of a screening tool where high sensitivity is paramount, a custom image-level evaluation criterion was employed: a true positive was recorded if any predicted bounding box had a Jaccard Index (IoU) > 0 with any ground truth annotation. Model performance was systematically evaluated at confidence thresholds of 0.10 and 0.05. ResultsAt a confidence threshold of 0.05, the caries detection model achieved a mean precision of 84.41% ({+/-}0.72%), recall of 85.97% ({+/-}4.72%), and an F1-score of 85.13% ({+/-}2.61%). The alveolar bone loss model demonstrated exceptionally high performance, with a mean precision of 95.47% ({+/-}0.94%), recall of 98.60% ({+/-}0.49%), and an F1-score of 97.00% ({+/-}0.46%). ConclusionThe YOLOv8-based models demonstrated high accuracy and high sensitivity for detecting dental caries and alveolar bone loss on periapical radiographs. The system shows significant potential as a reliable automated assistant for dental practitioners, helping to improve diagnostic consistency, reduce the risk of missed pathology, and ultimately enhance the standard of patient care.
Authamayou, B.; Marnat, G.; Matsulevits, A.; Munsch, F.; Lavielle, A.; Courbin, N.; Foulon, C.; Chen, B.; Micard, E.; Gory, B.; L'Allinec, V.; Bourcier, R.; Naggara, O.; Lauze, E.; Boulouis, G.; Lapergue, B.; Eker, O.; Sibon, I. P.; Thiebaut de Schotten, M.; Tourdias, T.
Show abstract
BackgroundAcute basilar artery occlusion (BAO) causes devastating strokes. Despite the benefit of endovascular treatment, the optimal management remains sometimes controversial, such as for patients with mild deficits, and would benefit from robust prognostic tools. Given the dense white matter networks within the posterior fossa, we tested whether quantifying disconnections from acute diffusion-weighted imaging (DWI) could improve outcome prediction and responders to recanalization compared with conventional metrics. MethodsWe conducted a secondary analysis from a prospective multicenter stroke registry, including consecutive patients (2017-2024) with BAO and admission MRI. Ultra-high-resolution diffusion MRI was acquired in healthy participants to build normative tractograms with optimized posterior fossa quality. Patient infarcts delineated on DWI were projected onto these tractograms to estimate disconnected fiber volume. The primary outcome was 90-day modified Rankin Scale (mRS) 0-3 vs 4-6. Predictive performance of disconnected fiber volume was compared with baseline NIHSS, infarct volume, and posterior circulation ASPECTS (pc-ASPECTS) using logistic regressions and areas under receiver operating characteristic curves (AUC). Ordinal regressions tested associations across the full mRS spectrum, stratified by recanalization status. Analyses were repeated in patients with NIHSS [≤]10. ResultsAmong 201 patients (median age 70; NIHSS 10), 97 (48.3%) had poor outcome. Despite small median infarct volume (4.75 mL), disconnected fiber volume was substantial (median 25.15 mL). Disconnected fiber volume achieved an AUC of 0.84, outperforming NIHSS (0.67; p<0.0001), infarct volume (0.75; p=0.00059), and pc-ASPECTS (0.76; p=0.0127). Low disconnected fiber volume predicted better outcomes across the full mRS (OR=0.12 [95% CI, 0.065-0.204]) and greater benefit from successful recanalization (OR=0.33 [95% CI, 0.15-0.70]). In patients with NIHSS [≤]10 (n=102), disconnected fiber volume remained the strongest predictor (AUC=0.83). ConclusionsDisconnected fiber volume derived indirectly is a robust prognostic marker of BAO outcomes that outperforms conventional predictors and may support future treatment decisions. Registrationhttps://clinicaltrials.gov - NCT03776877.
Bechtel, G. N.; Das, A.; Noyer, J.; Bush, A. M.; Hormuth, D.; Yankeelov, T. E.; Castillo, E.; Warach, S.; Fuhg, J.; Tamir, J. I.; Saber, H.; Rausch, M. K.
Show abstract
Background and PurposeNeurointerventional outcomes depend on clot composition and may be influenced by clot contraction. Thus, a priori identification of clot composition and contraction could inform procedural strategies and improve outcomes. The goal of our work is to conduct an in vitro test to determine whether MRI can reliably predict both clot composition and contractile state. Materials and MethodsTo this end, we prepared blood clots spanning clinically observed compositions (0-80% red blood cells (RBCs)) in both contracted and uncontracted states. Contraction was controlled by coagulating blood with or without thrombin. We imaged these clots using quantitative, clinical, and investigational MRI sequences. Using these data, we then determined whether MRI signal intensities, quantitative parameters, and radiomic features capturing intensity and texture patterns can (i) predict clot hematocrit and (ii) classify clots by composition (RBC-rich vs. fibrin-rich) and contraction state. ResultsQuantitative MRI parameters (T1, T2, ADC) decreased with increasing hematocrit (R2 = 0.56-0.85, p < 0.001), while signal intensities from clinical sequences showed weaker correlations (R2 = 0.46-0.62, p < 0.001). Radiomic models predicted hematocrit with performance comparable to MRI parameters. When applied to classification, radiomic features accurately discriminated RBC-versus fibrin-rich clots, with AUCs exceeding 0.90 across nearly all sequences. In contrast, classification of contraction state showed greater variability in AUCs across sequences but remained high for quantitative T1 and T2 values (AUCs up to 0.88). Trends were consistent across clots coagulated with and without thrombin. Pooling features across sequences did not outperform the best individual sequence for either regression or classification. ConclusionsWe demonstrate that MRI-based radiomic analysis quantitatively characterizes clot composition and contraction in vitro. These findings support the feasibility of using MRI for pre-interventional clot phenotyping, with potential to inform thrombolytic and mechanical thrombectomy strategies. Thus, in vivo studies validating these results are warranted.
Tan, J.; Tang, P. H.
Show abstract
Background: Paediatric pneumonia is a leading cause of childhood morbidity and mortality worldwide. Chest X-rays (CXR) are an important diagnostic tool in the diagnosis of pneumonia, but shortages in specialist radiology services lead to clinically significant delays in CXR reporting. The ability to communicate findings both to clinicians and laypersons allows MLLMs to be deployed throughout clinical workflows, from image analysis to patient communication. However, MLLMs currently underperform state-of-the-art deep learning classifiers. Objective: To evaluate the diagnostic accuracy of ensemble strategies with MLLMs compared to the baseline average agent for paediatric radiological pneumonia detection. Methods: We conducted a retrospective cohort study using paediatric CXRs from two independent hospital datasets totalling 2300 CXRs. Fifteen MedGemma-4B-it agents independently classified each CXR into five pneumonia likelihood categories. Majority voting, soft voting, and GPTOSS-20B aggregation were compared against the average agent performance. The primary metric evaluated was OvR AUROC. Secondary metrics included accuracy, sensitivity, specificity, F1-score, Cohen's kappa, and OvO AUROC. Results: Soft voting achieved improvements in OvR AUROC (p_balanced = 0.0002, p_real-world = 0.0003), accuracy (p_balanced = 0.0008, p_real-world < 0.0001), Cohen's Kappa (p_balanced = 0.0006, p_real-world = 0.0054) and OvO AUROC (p_balanced < 0.0001, p_real-world = 0.0011) across both datasets, and a superior F1-value (pbalanced = 0.0028) for the balanced dataset. Conclusion: Soft voting enhances MedGemma's diagnostic discriminatory performance for paediatric radiological pneumonia detection. Our system enables privacy-preserving, near real-time clinical decision support with explainable outputs, having potential for integration into emergency departments. Our system's high specificity supports triage by flagging high-risk radiological pneumonia cases.
Agumba, J.; Erick, S.; Pembere, A.; Nyongesa, J.
Show abstract
Abstract Objectives: To develop and evaluate a deployable deep learning system with Gradient-weighted Class Activation Mapping (Grad-CAM) for tuberculosis screening from chest radiographs and to assess its classification performance and explainability across desktop and mobile deployment platforms. Materials and methods: This study used publicly available chest X-ray datasets containing Normal and Tuberculosis images. A DenseNet121-based transfer learning model was trained using stratified training, validation, and test splits with data augmentation and class weighting. Model performance was evaluated using accuracy, precision, recall, F1 score, receiver operating characteristic (ROC) curve, and area under the ROC curve (AUC). Grad-CAM was used to visualize regions influencing model predictions. The trained model was converted to TensorFlow Lite and deployed in both a Windows desktop application and a Flutter-based mobile application for offline inference and visualization. Results: The model demonstrated strong classification performance on the independent test dataset, with high accuracy and AUC values indicating effective discrimination between Normal and Tuberculosis cases. Grad-CAM visualizations showed that the model focused primarily on anatomically relevant lung regions, particularly the upper and mid-lung fields in Tuberculosis cases. Deployment testing confirmed consistent prediction outputs and Grad-CAM visualizations across both Windows and mobile platforms. Conclusion: The proposed deployable deep learning system with Grad-CAM provides accurate and interpretable tuberculosis screening from chest radiographs and demonstrates feasibility for offline mobile and desktop deployment. This approach has potential as an artificial intelligence-assisted screening and decision support tool in radiology, particularly in resource-limited and remote healthcare settings.
Hofmeister, J.; Bernava, G.; Rosi, A.; Brina, O.; Reymond, P.; Muster, M.; Lovblad, K.-O.; Machi, P.
Show abstract
Background: Even for experienced operators, endovascular treatment of unruptured intracranial aneurysms involves intraoperative uncertainty that may lead to adjustments in strategy, prolong the procedure, and potentially cause inefficiency and device waste. This study aimed to evaluate whether pre-procedural testing (PPT) of endovascular treatment using patient-specific models was associated with increased operator confidence and perceived clinical utility, including improvements in procedural efficiency and reduced resource waste. Methods: We enrolled a cohort of patients who underwent PPT before endovascular treatment for complex unruptured intracranial aneurysms and compared their outcomes with a control group treated without PPT. The primary outcome was the Training Fidelity Score, a composite of three operator-reported Likert items defined a priori. Secondary outcomes included perceived clinical utility, intraoperative strategy changes, procedural time, radiation exposure, device waste and safety. Results: A total of 85 patients met the inclusion criteria (PPT=40; control=45). The Training Fidelity Score was high across the PPT group (median, 4.33/5). Perceived clinical utility was high and further increased significantly after the procedure. A significant reduction was observed in intraoperative strategy changes, with no changes recorded in the PPT group, compared to 6/45 in the control group (RR 0.09; p=0.027). Reductions in treatment time, radiation exposure and device waste were also noted. Conclusion: PPT using patient-specific models was associated with increased operator confidence, fewer intraoperative strategy changes, improved procedural efficiency, and reduced device waste without compromising safety. These findings support its use in pre-interventional preparation, but require prospective multicenter validation.
Hue, J.; Yeo, J.; Saigo, L.
Show abstract
Objectives: Accurate assessment of orthognathic surgical accuracy is essential in the evaluation of operative techniques. Surgical accuracy is often reported as rotational and translational deviations from planned positions. This results in 6 separate values, translation in three planes, anterior-posterior (AP), superior-inferior (SI) and medial-lateral (ML) and rotations about three axes, pitch, roll and yaw. However, rotations will influence 3-dimensional positions and translational discrepancies. Methods: We have derived a mathematical formula using Euclidean geometry and quadratic functions that quantifies the impact of rotations on translational discrepancies. This allows for the calculation of a total discrepancy value that incorporates the three translations and rotations. Furthermore, we developed an interactive web-based application using the open-source shiny R package. Results: We have successfully reduced equations from Euclidean geometry into a quadratic form. The equation is as follows, [4(sin{theta}/2)2-2]x2 + [8d(sin{theta}/2)2-2d]x + 4d2(sin{theta}/2)2 = 0, where {theta} represents the rotational discrepancy in radians and d represents the translation discrepancy. This allows us to solve for the correction needed to be made to translational discrepancies to account for the influence of rotational discrepancies. We successfully developed a web application with a user-friendly graphical user interface. Clinicians upload their own data in the excel (.xlsx) file format and the application automatically performs the necessary calculations over many patients, returning a downloadable table of results. Conclusion: We present a mathematical formula incorporated into a web-application to combine translational and rotational discrepancies for deeper insight when evaluating orthognathic surgical accuracy. Clinical Relevance: This allows surgeons to account for rotational influence on 3-dimensional translational discrepancies.
Hou, J.; Yi, X.; Li, C.; Li, J.; Cao, H.; Lu, Q.; Yu, X.
Show abstract
Predicting response to induction chemotherapy (IC) and overall survival (OS) is critical for optimizing treatment in patients with locally advanced nasopharyngeal carcinoma (LANPC). This study aimed to develop and validate a multi-task deep learning model integrating pretreatment MRI and whole slide images (WSIs) to predict IC response and OS in LANPC. Pretreatment MRI and WSIs from 404 patients with LANPC were retrospectively collected to construct a multi-task model (MoEMIL) for the simultaneous prediction of early IC response and OS. MoEMIL employed multi-instance learning to process WSIs, PyRadiomics and a convolutional neural network (ResNet50) to extract MRI features, and fused multimodal features through a multi-gate mixture-of-experts architecture. Clustering-constrained attention multiple instance learning and gradient-weighted class activation mapping were applied for visualization and interpretation. MoEMIL effectively stratified patients into good and poor IC response groups, achieving areas under the curve of 0.917, 0.869, and 0.801 in the train, validation, and test sets, respectively, and outperformed the deep learning radiomics model, the pathomics model and TNM staging. The model also stratified patients into high- and low-risk OS groups (P < 0.05). MoEMIL shows promise as a decision-support tool for early IC response prediction and prognostication in LANPC. Author SummaryWe have developed a deep learning model that integrates two types of medical images, including magnetic resonance imaging (MRI) and digital pathological slices, to simultaneously predict response to induction chemotherapy and prognosis in patients with locally advanced nasopharyngeal carcinoma. Current treatment decisions primarily rely on traditional tumor staging (TNM), which often fails to comprehensively reflect the complexity of the disease. Our model, named MoEMIL, was trained and tested on data from 404 patients across two hospitals and consistently outperformed both single-model approaches and TNM staging methods. By identifying patients who exhibit poor response to induction chemotherapy or higher prognostic risk, our tool can assist clinicians in achieving personalized treatment, enabling intensified management for high-risk patients and avoiding unnecessary side effects for low-risk patients. Additionally, we visualize the models reasoning process through heat map generation, which highlights the image regions exerting the greatest influence on prediction outcomes. This work represents a step toward more precise treatment for nasopharyngeal carcinoma; however, larger-scale prospective studies are required before the model can be integrated into routine clinical practice.
Werner, C. J.; Meyer, T.; Pinho, J.; Mall, B.; Schulz, J. B.; Schumann-Werner, B.
Show abstract
Purpose: Neurogenic dysphagia is prevalent in neurological inpatients and associated with adverse outcomes, yet its independent economic impact after adjustment for frailty and functional status remains poorly quantified. We aimed to estimate the independent effect of dysphagia on hospital length of stay (LOS) and costs, to test whether this effect differs between geriatric and non-geriatric patients, and to quantify the probability and magnitude of cost savings from improvements in swallowing function. Methods: We analysed 10,375 neurological inpatient cases (2021-2024) at a German university hospital. Dysphagia was defined by fiberoptic endoscopic evaluation of swallowing (FEES) or ICD-10 R13 coding (n = 1,382; 13.3%). Bayesian Gamma-log regression with informative priors from historical data and published literature was used to model LOS and total case costs (German DRG), adjusted for age, sex, Hospital Frailty Risk Score (HFRS, R13-adjusted), self-care index ("Selbstpflege-Index", SPI), stroke status, and emergency admission. A geriatric cohort was defined as age >=70 and adjusted HFRS >=5 (n = 2,053; 19.8%). Posterior predictive simulation estimated cost savings for hypothetical improvements of 1-3 points on the Functional Oral Intake Scale (FOIS). Results: After comprehensive adjustment, dysphagia was independently associated with 46.5% longer LOS (posterior ratio 1.465; 95% credible interval [CrI] 1.397-1.537) and 28.2% higher total case costs (ratio 1.282; CrI 1.213-1.354). The dysphagia x geriatric interaction was small but credible and ran in opposite directions: slightly attenuated for LOS (interaction ratio 0.908, CrI 0.837-0.986) but slightly amplified for costs (1.096, CrI 1.012-1.185), consistent with complexity-driven DRG grouping in geriatric patients. The absolute economic burden remained larger in the geriatric cohort due to higher baseline costs. In the geriatric cohort, a one-point FOIS improvement yielded a 74.3% posterior probability of LOS-based savings (mean EUR 555/case); at three points, this rose to 84.2% (mean EUR 1,115/case). The direct cost model confirmed high benefit probabilities from the payer's perspective (82.6% at dFOIS = 3). Conclusions: Neurogenic dysphagia is an independent and substantial driver of hospital LOS and costs in neurological inpatients, even after adjustment for frailty and functional status. The proportional effect on costs is slightly larger in geriatric patients, while the LOS effect is slightly smaller, consistent with the mechanics of the G-DRG system. Bayesian simulation indicates that improvements in swallowing function carry a high probability of generating cost savings, supporting the characterisation of dysphagia as a modifiable economic target with particular relevance to geriatric neurology.
Sarwin, G.; Ricciuti, V.; Staartjes, V. E.; Carretta, A.; Daher, N.; Li, Z.; Regli, L.; Mazzatenta, D.; Zoli, M.; Seungjun, R.; Konukoglu, E.; Serra, C.
Show abstract
Background and Objectives: We report the first intraoperative deployment of a real-time machine vision system in neurosurgery, derived from our previous anatomical detection work, automatically identifying structures during endoscopic endonasal surgery. Existing systems demonstrate promising performance in offline anatomical recognition, yet so far none have been implemented during live operations. Methods: A real-time anatomy detection model was trained using the YOLOv8 architecture (Ultralytics). Following training completion in the PyTorch environment, the model was exported to ONNX format and further optimized using the NVIDIA TensorRT engine. Deployment was carried out using the NVIDIA Holoscan SDK, the system ran on an NVIDIA Clara AGX developer kit. We used the model for real-time recognition of intraoperative anatomical structures and compared it with the same video labelled manually as reference. Model performance was reported using the average precision at an intersection-over-union threshold of 0.5 (AP50). Furthermore, end-to-end delay from frame acquisition to the display of the annotated output was measured. Results: A mean AP50 of 0.56 was achieved. The model demonstrated reliable detection of the most relevant landmarks in the transsphenoidal corridor. The mean end-to-end latency of the model was 47.81 ms (median 46.57 ms). Conclusion: For the first time, we demonstrate that clinical-grade, real-time machine-vision assistance during neurosurgery is feasible and can provide continuous, automated anatomical guidance from the surgical field. This approach may enhance intraoperative orientation, reduce cognitive load, and offer a powerful tool for surgical training. These findings represent an initial step toward integrating real-time AI support into routine neurosurgical workflows.
Dell'Orco, A.; De Vita, E.; D'Arco, F.; Lange, A.; Rüber, T.; Kaindl, A. M.; Wattjes, M. P.; Thomale, U. W.; Becker, L.-L.; Tietze, A.
Show abstract
Focal cortical dysplasias (FCDs) are one of the most common structural causes of drug-resistant epilepsy in children but are frequently subtle and difficult to detect on conventional MRI. Many automated lesion detection methods have therefore been proposed to support neuroradiological assessment. In this study, we externally validated two recently developed deep-learning approaches for FCD detection, MELD Graph and 3D-nnUNet, in a pediatric cohort. In this retrospective single-center study, brain MRI scans of 71 children evaluated for epilepsy were analyzed, including 35 MRI-positive patients with suspected FCD and 36 MRI-negative cases based on the primary radiology reports. Both models were applied to standard 3D T1-weighted and 3D FLAIR images. Detected lesions were reviewed by an experienced pediatric neuroradiologist and classified as true positive, false positive, or false negative. Clinical semiology and EEG findings were additionally evaluated for cases with false-positive detections. At the lesion level, MELD Graph achieved a precision of 0.85 and recall of 0.52, while 3D-nnUNet achieved a precision of 0.91 and recall of 0.48. In the MRI-negative patients, MELD Graph produced more false-positive detections than 3D-nnUNet (0.53 vs. 0.14 false-positive lesions per patient). At the patient level, MELD Graph showed slightly higher sensitivity than 3D-nnUNet (0.63 vs. 0.54), whereas 3D-nnUNet demonstrated markedly higher specificity (0.86 vs. 0.56). Improved FLAIR image quality was associated with trends toward improved model performance. Both models demonstrated high precision but moderate sensitivity, indicating that they are valuable decision-support tools but cannot replace expert neuroradiological evaluation. Optimized MRI acquisition protocols are needed to further improve automated lesion detection in pediatric epilepsy.
Sivakumar, E.; Anand, A.
Show abstract
Computer vision and deep learning techniques, including convolutional neural networks (CNNs) and transformers, have increased the performance of medical image classification systems. However, training deep learning models using medical images is a challenging task that necessitates a substantial amount of annotated data. In this paper, we implement data augmentation strategies to tackle dataset imbalance in the VinDr-SpineXR dataset, which has a lower number of spine abnormality X-ray images compared to normal spine X-ray images. Geometric transformations and synthetic image generation using Generative Adversarial Networks are explored and applied to the abnormal classes of the dataset, and classifier performance is validated using VGG-16 and InceptionNet to identify the most effective augmentation technique. Additionally, we introduce a hybrid augmentation technique that addresses class imbalance, reduces computational overhead relative to a GAN-only approach, and achieves ~99% validation accuracy with both classifiers across all three case studies. Keywords: Data augmentation, Generative Adversarial Network, VGG-16, InceptionNet, Class imbalance, Computer vision, Spine X-ray, Radiology.
Gangolli, M.; Perkins, N. J.; Marinelli, L.; Basser, P. J.; Avram, A. V.
Show abstract
BACKGROUNDMild traumatic brain injury (mTBI) is a signature injury in civilian and military populations that remains invisible to detection by conventional radiological methods. Diffusion MRI has been identified as a potential clinical tool for revealing subtle microstructural alterations associated with mTBI. OBJECTIVEThis study evaluates whether a comprehensive and powerful diffusion MRI (dMRI) technique called mean apparent propagator (MAP) MRI can detect sequelae of mTBI. METHODSWe analyzed data from 417 participants of the GE/NFL prospective mTBI study which included 143 matched controls (mean age, 21.9 {+/-} 8.3 years; 76 women) and 274 patients with acute mTBI and GCS [≥]13 (mean age, 21.9 {+/-} 8.5 years; 131 women). All participants underwent MRI exams at up to four visits including structural high-resolution T1W, T2W, FLAIR-T2W, and dMRI, in addition to clinical assessments of post-concussive physical symptoms (RPQ-3), psychosocial functioning and lifestyle symptoms (RPQ-13), and postural stability (BESS). The dMRI data for each subject were co-registered across all visits and analyzed using the MAP-MRI framework to measure and map the distribution of net microscopic displacements of diffusing water molecules in tissue and ultimately compute the microstructural MAP-MRI tissue parameters including propagator anisotropy (PA), Non-Gaussianity (NG), return-to-origin probability (RTOP), return-to-axis probability (RTAP), and return-to-plane probability (RTPP). We quantified voxel-wise and region-of-interest (ROI)-based changes in these parameters across all four visits. RESULTSMAP-MRI parameter values were within the expected ranges and showed relatively little variation across visits. We found no significant differences in the longitudinal trajectories of these parameters between mTBI patients and controls. At acute post-injury timepoints, RPQ-3 and RPQ-13 scores were increased in mTBI patients relative to controls, while BESS scores were not significantly different between groups. Analysis of dMRI metrics and clinical mTBI markers showed significant correspondence between MAP-MRI metrics in cortical gray matter, caudate and pallidum and BESS scores. CONCLUSIONWe developed and tested a state-of-the-art quantitative image processing pipeline for sensitive analysis and detection of subtle tissue changes in longitudinal clinical diffusion MRI data. The absence of a significant statistical difference between populations in the dMRI parameters in this study suggests that the mTBI corresponded to acute post-injury clinical symptoms but that the injury was not severe enough to cause detectable microstructural damage/alterations, and that increased diffusion sensitization combined with improved analysis techniques may be needed. CLINICAL IMPACTThese findings suggest that acute mTBI (GCS[≥]13) may not be detectable with diffusion MRI. TRIAL REGISTRATIONClinicalTrials.gov NCT02556177
Altinok, O.; Ho, W. L. J.; Robinson, L.; Goldgof, D.; Hall, L. O.; Guvenis, A.; Schabath, M. B.
Show abstract
Objectives: Among surgically resected non-small cell lung cancer (NSCLC) patients with similar stage and histopathological characteristics, there is variability in patient outcomes which highlights urgency of identifying biomarkers to predict recurrence. The goal of this study was to systematically develop a pre-surgical CT-based habitat-based radiomics classifier to predict recurrence-of-risk in NSCLC. Methods: This study included 293 NSCLC patients with surgically resected stage IA-IIIA disease that were randomly divided into a training (n = 195) and test cohorts (n = 98). From pre-surgical CT images, tumor habitats were generated using two-level unsupervised clustering and then radiomic features were calculated from the intratumoral region and habitat-defined subregions. Using ridge-regularized logistic regression, separate classifiers were developed to predict 3-year recurrence using intratumoral radiomics, habitat-based radiomics, and a combined model (intratumoral and habitat) which was generated using a stacked learning framework. For each classifier, probability of recurrence was calculated for each patient then numerous statistical and machine learning approaches were utilized to stratify patients for recurrence-free survival. Results: The combined radiomics classifier yielded a superior AUC (0.82) compared to the intratumoral (AUC = 0.75) and habitat radiomics (AUC = 0.81) models. When the classifiers were used to stratify high- versus low-risk patients utilizing a cut-point identified by decision tree analysis, high-risk patients were yielded the largest risk estimate (HR = 8.43; 95% CI 2.47 - 28.81) compared to the habitat (HR = 5.41; 95% CI 2.08 - 14.09) and intratumoral radiomics (HR = 3.54; 95% CI 1.45 - 8.66) models. SHAP analyses indicated that habitat-derived information contributed most strongly to recurrence prediction. Conclusions: This study revealed that habitat-based radiomics provided superior statistical performance than intratumoral radiomics for predicting recurrence in NSCLC.
Romano, D. J.; Roberts, A. G.; Weppner, B.; Zhang, Q.; John, M.; Hu, R.; Sisman, M.; Kovanlikaya, I.; Chiang, G. C.; Spincemaille, P.; Wang, Y.
Show abstract
Purpose: To develop a deep neural network-based, AIF-free, perfusion estimation method (QTMnet) for improved performance on glioma classification. Methods: A globally defined arterial input function (AIF) is needed to recover perfusion parameters in the two-compartment exchange model (2CXM). We have developed Quantitative Transport Mapping (QTM) to create an AIF-independent estimation method. QTM estimation can be formulated using deep neural networks trained on synthetic DCE-MRI data (QTMnet). Here, we provide a fluid mechanics-based DCE-MRI simulation with exchange between the capillaries and extravascular extracellular space. We implemented tumor ROI generation to morphologically characterize tissue perfusion. We compared our QTMnet implementation with 2CXM on 30 glioma human subjects, 15 of which had low-grade gliomas, and 15 with high-grade glioblastomas. Results: QTMnet outperforms (best AUC: 0.973) traditional 2CXM (best AUC: 0.911) in a glioma grading task. Conclusion: The AIF-independent QTMnet estimation provides a quantitative delineation between low-grade and high-grade gliomas.
Xu, R.; Dou, H.; Zhang, M.; Liu, Z.
Show abstract
Background: To investigate the safety and efficacy of CTguided lung nodule localization needles for the preoperative localization of small pulmonary nodules. Methods: A retrospective study was conducted on 102 patients with a total of 113 small pulmonary nodules who underwent preoperative localization at Jinan Fourth People's Hospital from January 2024 to December 2025. Nodule diameter and depth, localization time, the number of pleural punctures, the localization success rate, and postoperative complications (hook dislodgement, hemorrhage, and pneumothorax) were recorded. All patients underwent video assisted thoracoscopic surgery (VATS) after localization. Results: The mean nodule diameter was 0.97{+/-}0.36 cm, the mean depth was 1.26{+/-}0.48 cm, and the mean localization time was 9.8{+/-}3.65 minutes. The hook dislodgement rate was 0.98% (1/102), the intrapulmonary hemorrhage rate was 14.71% (15/102), and the pneumothorax rate was 16.67% (17/102). All pulmonary nodules were successfully resected by VATS at 73.82{+/-}13.83 minutes after localization, and no severe complications occurred. Conclusions: The use of a CTguided lung nodule localization needle for the preoperative localization of small pulmonary nodules decreases the time needed for intraoperative nodule detection and operation time. This strategy is a simple, safe, and accurate preoperative localization method that is worthy of increased clinical use.
Heine, J.; Fowler, E.; Egan, K.; Weinfurtner, R. J.; Balagurunathan, Y.; Schabath, M. B.
Show abstract
A substantial body of evidence demonstrates that measures from mammograms are predictive of breast cancer risk. In this matched case-control study, mammograms acquired near the time of diagnosis were analyzed to investigate bilateral breast asymmetry as measure of short-term risk prediction. Specifically, contralateral breast images were compared with measures derived in the Fourier domain (FD); this technique summarizes power in concentric radial bands that cover the Fourier plane. Equivalently, this approach can be described as a multiscale characterization of the image. The summarized power difference between respective contralateral bands produces an asymmetry measure. Full field digital mammography (FFDM) and synthetic two-dimensional images from digital breast tomosynthesis (DBT) were investigated for women that had both types of mammograms acquired at the same time. Odds ratios (ORs) and the area under the receiver operating curves (Azs) were generated from conditional logistic regression modeling with 95% confidence intervals. Raw unprocessed FFDM images produced significant findings: OR = 1.90 (1.58, 2.29) and Az = 1.72 (0.67, 0.76) per one standard deviation unit. Associations were significant but attenuated for both clinical FFDM and DBT images: OR = 1.31 (1.11, 1.54) and Az = 0.63 (0.58, 0.67); and OR = 1.48 (1.25, 1.76) and Az = 0.65 (0.60, 0.70), respectively. Results suggest that clinical FFDM and DBT images are inferior to raw FFDM images in capturing breast asymmetry with information loss for breast cancer risk prediction. Moreover, these DBT images have lower spatial resolution but produced stronger associations than the clinical FFDM images.
Wen, X.; Sun, Y.; Zhou, X.; Li, Y.; Paez, A.; Varghese, J.; Pillai, J. J.; Knutsson, L.; Van Zijl, P. C. M.; Leigh, R.; Kamson, D. O.; Graley, C. R.; Saidha, S.; Bakker, A.; Ward, B. K.; Kashani, A. H.; Hua, J.
Show abstract
Background: Recently, a posterior pathway for fluid drainage from the retina to the meningeal lymphatics in the optic nerve (ON) sheath was identified in rodents using intravitreal imaging tracers directly injected into the ocular-globe. Fluid and solute clearance along this pathway may be associated with many diseases. However, intravitreal tracers are rarely used in clinical imaging. As intravenous Gadolinium-based-contrast-agent (GBCA) can enter the globe via the blood-ocular-barriers, it may provide an alternative approach to image this pathway. Purpose: To establish a clinically feasible intravenous GBCA-based MRI approach for tracking fluid and solute transport along the posterior lymphatic pathway in the ocular glymphatic system. Materials & Methods: This prospective study was conducted from March 2021 to September 2022 in healthy participants. Dynamic-susceptibility-contrast-in-the-CSF (cDSC) MRI was performed before, immediately and 4 hours after intravenous-GBCA administration to track GBCA distribution in aqueous humor (AH) and cerebrospinal fluid (CSF) in regions-of-interest (ROIs) in the globe (anterior-cavity, vitreous-body), in the intraorbital and extraorbital ON, and in the intracranial CSF space proximal to the ON (chiasmatic-cistern, interpeduncular-cistern). Kruskal-Wallis tests with post-hoc Dunn's tests were used for group comparisons. Results: Sixteen healthy participants (mean age +/- SD: 51 +/- 21 years, 5 men) were recruited. Intravenous-GBCA enhancement was observed in all ROIs immediately after injection. At 4-hour-post-GBCA, the vitreous body showed a trend of smaller enhancement area (55 +/- 11% versus 49 +/- 11%, P=.14) and lower GBCA-concentration (0.044 +/- 0.014 versus 0.028 +/- 0.010 mmol/L, P=.07) compared to immediate-post-GBCA. The intraorbital ON showed more widespread enhancement (39 +/- 5% versus 59 +/- 6%, P=.01) and significantly higher GBCA-concentration (0.023 +/- 0.009 versus 0.059 +/- 0.015 mmol/L, P<.001) at 4-hour-post-GBCA. Conclusion: Dynamic fluid and solute transportation along the posterior lymphatic pathway in the ocular glymphatic system in healthy participants was measured by tracking intravenous-GBCAs entering the globe via the blood-ocular-barriers using cDSC-MRI.